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Abstract 

Molecular mimicry defined as similar structures 
shared by molecules from dissimilar genes or by their 
protein products, is a general strategy for pathogens 
to infect host cells. Therefore, identification of the 
molecular mimic regions of a pathogen may be helpful 
to understand the disease. Severe acute respiratory 
syndrome (SARS) is a new human_ respiratory 
infectious disease caused by SARS coronavirus 
(SARS-CoV). The virus uses the spike (S) protein to 
interact with the angiotensin converting enzyme 2, the 
host cell receptor. Our approach is to design a 
workflow with multiple bioinformatics tools in 
analyzing the sequence of spike protein of the SARS- 
CoV in searching its similarity to human proteins. 
Furthermore, eleven peptides have been synthesized to 
validate the in silico results. 


1. Introduction. 


Severe Acute Respiratory Syndrome” (SARS) is a 
new emerging infectious disease, which was first 
reported in China in 2002 [1, 2]. The disease is 
caused by a novel coronavirus (SARS-CoV) [3, 4, 5]. 
SARS-CoV is a positive-stranded RNA virus. The 
genome of SARS-CoV is around 29,727 nucleotides in 
length. The sequence was annotated in silico [6]. 
According to the genomic sequence of SARS-CoV, it 
is predicted that SARS-CoV produces _ several 
structural proteins including spike (S), envelop (E), 
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membrane (M), and nucleocapsid proteins. S protein 
is a key molecule for the initial viral entry into human 
cells. The majority of S protein (residues 12-1195) is 
outside the virus particle, which can be divided into 
amino-terminal S1 and carboxyl-terminal S2 domain. 
The S1 domain (residues 12-672) binds to the host cell 
receptor, angiotensin-converting enzyme 2 (ACE2) 
while the S2 domain is responsible for membrane 
fusion [7, 8, 9]. 

Molecular mimicry, which is defined as similar 
structures shared by molecules from dissimilar genes 
or by their protein products, is a general strategy for 
pathogens to infect host cells and has been proposed as 
a pathogenic mechanism for autoimmune disease [10]. 
Therefore, identification of the molecular mimic 
regions of pathogen may be helpful to understand the 
disease induced by that pathogen. At present, it is 
unclear whether molecular mimicry occurs between 
SARS-CoV S proteins and human peptides. We have 
approached this question by in silico analyzing the 
sequence of spike protein of SARS-CoV and select 
regions that share the sequence homology with human 
proteins. Synthetic peptides were synthesized to 
validate the prediction. 


2. Materials and Methods. 


2.1. Peptide prediction and synthesis 


Publically available human and_ coronavirus 
genome sequences at the National Center for 
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Biotechnology Information USA were used for in 
silico prediction. Immunogenic viral peptides were 
calculated based on the algorithm developed by 
Kolaskar and Tongaonkar [11]. Jn silico secondary 
structural analyses of spike protein were performed 
based on PHD [12] and PREDATOR [13] algorithms. 
Protein topology prediction was based on_ the 
algorithm developed by TMHMM [14]. 
Hydrophobicity of the peptides was calculated based 
on the algorithh HMOMENT [15]. Similarity 
searches between S protein and human genome 
database were performed by using BLASTP [16]. 
Extra amino acid residues were added at either N- or 
C- terminus to decrease the hydrophobicity. Multiple 
antigen peptides were synthesized by CytoMol Corp 
(Mountain View, CA, USA). 


2.2 Mice immunization 


Six to eight-week-old female BALB/c mice were 
originally purchased from Jackson Laboratory (Bar 
Harbor, ME, USA) and bred in the Laboratory Animal 
Center, National Cheng Kung University. Synthetic 
peptides were emulsified with complete Freund’s 
adjuvant and injected intraperitoneally. Mice were 
boosted intraperitoneally two weeks after priming. 
Sera were collected from the axially plexus of the mice 
at different time intervals. 


2.3 SDS-PAGE and Western blot analysis 


Proteins in the cell lysate of A549 were separated by 
12% SDS-PAGE and transferred to nitrocellulose 
sheets. Proteins recognized by normal or peptide D08 
hyperimmune mice sera were detected by using HRP- 
conjugated anti-mouse immuno- globulin antibodies 
and substrates. 


3. Results. 


3.1 Search for molecular mimic regions in S 
protein. 


The whole amino acid sequence of spike protein 
was analyzed to find out the potential immunogenic 
regions and the regions shared sequence homology 
with human proteins, which is defined as_ the 
pathogenic regions. As shown in Figure 1, there are 4 
pathogenic regions. Region 1 (residues 199-254), 
region 2 (residues 658-715), region 3 (residues 893- 
941), and region 4 (residues 1127-1184) have shared 
sequence homology with hydroxyacid oxidase, human 
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golgi autoantigen, angrgm-52, and _pallidin, 
respectively. Among these regions, region 3 has the 
highest score. 


MFIFLLFLTLTSGSDLDRCTT FDDVQAPNY TQHTSSMRGVY Y PDEIFRSDTLY LTQDLFLPFYSNVTGFH 
TINHTFGNPVIPFKDGIY FAATEKSNVVRGWVEGS TMNNKSOQSVIIINNSTNVV IRACNFELCDNPFFAV 

SK PMGTQTHTMIFDNAENCT FEY ISDAFSLDVSEKSGNFKI ILREFVFKNKDGFLYVY Kin 
SGFNTLKPIFKLPLGINITNERAILTAFSPAQDIWGTSAAAYEVGYL KPTTFMLKYDENGTIT DAVDCSQ 
NPLAELKCSVKSFEIDKGIY QTSNFRV VPSGDVVREPNITNLCPFGEVFNATKEPSV YAWERKKISNCVA 
DYSVLYNSTFFSTFKCY GVSAT KLNDLCFSNV YADSFVVKGDDVROQIAPGQTGVIADYNY KLPDDFMG 
CVLAWNTRNIDATSTGNYNY KYRYLRHGKLRPFERDISNVPFSPDGKPCTPPALNCY WPLNDYFYTTTGI 
GYQPYRVVVLSFELLNAPATVCGPKLSTDLIKNQCVNENFNGLTGTGVLTPSSKRFQPFQQFGR 
DVSDFTDSVRDPKTSEILDISPCAFGGVSVITPGTNASSEVAVLY QDVNCTDVSTAIHADOQLTPAWRIYSTG 
NNVFOTOAGCLIGAEHVDTSYECDIPIGAGIC IIR SO KSIVAYTMSLGADSSIAYSNNTIAIPTNE 
SISITTEVMPVSMAKTSVDCNMYICGDST A cD 


YKTPTLKYFGGFNFSOILPDPLK PTKRSFIE DLLFNK VTLADAGFM KOY GECLGDINARDLICAQKFNGL 


TVLPPLLTDDMIAAYTAALVSGTATAGWTFGAGAALQIPFAMQMAY RENGIGV TONVLYENQKQI ANOEN 
EE rs oI «1s visoiseivkveacvowentr 
GRLOSLOTYVTQQLIRAAEIRAS ANLAATKMSECVLGOQSKRVDFCGKGYHLMSFPQAAPHGVVFLHVTYV 


PSQERNFTTAPAICHEGKAY FPREGVFVENGTSWFIT QRNFFSPQIITTDNT FVSGNCDVVIGIINNTVY 
DPLOPELDSFKEELDKYFKNHTSPDVDLGD1SCRMASMAIKEIDRLNEVAKNLNESLIDLOELOKY EO 
> WAAAVEGRIAGIIAIM VTL LCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLHYT 


Figure 1 Jn silico analysis of S protein. 

S protein amino acid sequence was analyzed to find 
immunogenic regions (yellow regions) and pathogenic 
regions (regions with shared sequence homology with 
human proteins, blue regions). Purple regions are both 
immunogenic and pathogenic regions. Grey region is the 
leader sequence and brown region is the transmembrane 
region. 


In addition, because des-Arg bradykinin and Ang I 
are the substrates for ACE2 [17], we also compared 
the sequence of S protein against bradykinin and Ang I 


and found that residues 490-502 
(GYQPYRVVVLSFEE) of S_ protein showed 
sequence homology with bradykinin (Figure 2). 
Spike protein GYQPYRVVVLS 
| : 2 | a | * @ = e| a 
Bradykinn =§_—s GPF SPFRSSRIG 


Figure 2 Sequence homology between S protein 
and bradykinin. 
Pair wise alignment was calculated based on Smith- 
Water local alignment with matrix set at EBLOSUM 45. 
7 and “.” are for 


66,99 


is annotated for identical residues; 
similar residues. 


3.2 In silico predicted pathogenic peptides. 
Eleven antigenic peptides (Table 1), which 
represent those pathogenic regions were predicted and 


synthesized. 


Table 1. Sequence of the eleven synthetic peptides. 


” Extra amino acid residues which were indicated by italic 
letters were added at either N- or C-terminus to decrease the 


hydrophobicity. 
Peptide Amino acid positions Amino acid sequence No. of amino acids 
DO1 199-210 GYQPIDVVRDLG 12 
DO2 658-669 ASYHTVSLLRST SQK 15 
DO3 733-744 EEGNLLLQYGSFCTQ is 
D04 745-753 EELNRALSGIAGQ 13 
DO5 763-770 VFAQVKQM 8 
DO6 911-919 KAISQIQESLTTE 13 
DO7 927-937 GLGKLQDVVNQNGE 14 
DO8 942-951 ALNTLVKQLSSN 12 
DOS 1154-1162 INASVVNIQK 10 
D10 490-502 GYQPYRVVVLSFEE 14 
D11 306-317 GFRVVPSGDVVRF 13 


3.3. Hyperimmune sera against D08 cross- 
reacted with A549 cells 


To test whether synthetic peptides DO1, DO7 DO8 
can induce antibodies cross-reacted with human 
proteins, we immunized mice with these peptides to 
generate hyperimmune sear against these peptides. 
Using Western blot analysis, hyperimmune sera 
against DO8 could recognize more bands in A549 cell 
lysate as compared to normal mice sera (Figure 3). In 
addition, hyperimmune sera against DO7 but not DOI 
showed similar cross-reactivity to A549 cells as 
hyperimmune sera against D08 did (data not shown). 


—_ 
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Figure 3 Anti-D08 peptide sera recognizes protein 
from human A549 cell line. 


3.4 Hyperimmune sera against D10 cross- 
reacted with bradykinin 


To test whether the synthetic peptides D10 indeed 
can induce antibodies cross-reactive with bradykinin 
and Ang I, as predicted, we immunized mice with D10 
peptides to generate hyperimmune sera against this 
peptide. Significant increase of antibodies against D10 
was found in D10 hyperimmune sera, which could 
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cross-react with bradykinin- but not with Ang I-coated 
plates (Figure 4). 
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Figure 4 The cross-reactivity of D10 antibody with 
bradykinin and Ang I. 

Hyperimmune sera from D10 immunized mice (m) or 
normal mice sera (A) were diluted as indicated and reacted 
with D10-, bradykinin- or Ang I-coated ELISA plates as 
indicated. Bound antibodies were detected as described in 
Materials and Methods. Data represents the mean + SD of 
triplicates. 


4. Discussion. 


In this study we have identified four pathogenic 
regions of SARS-CoV S protein which share sequence 
homology with different human proteins. Among them, 
pathogenic region 3 (residues 893-941), which shares 
sequence homology with angrgm-52 (GenBank 


accession no. AAL62340), a novel gene up-regulated 
in human mesangial cells stimulated by angiotensin I, 
may deserve further investigation. Peptides DO7 and 
DO8 of this region were recognized by the sera of 
SARS patient indicating that this region is 
immunogenic and can be recognized by the immune 
system during SARS-CoV infection. Murine 
hyperimmune sera against peptides DO7 or DO8 were 
able to bind to recombinant S2 but not S1 domain of S 
protein (data not shown). In addition, hyperimmune 
sera against DO7 or DO8 also bound to the cytoplasmic 
region of A549 cells and recognized several proteins 
in the A549 cell lysate. These results indicate that 
regions represented by DO7 and D08 are immunogenic 
and may induce autoantibodies. However, further 
study is required to understand the biological function 
of these regions and the role of their antibodies in the 
pathogenesis of SARS-CoV infection. 

In addition to DO7 and DO8 peptides, we also 
noticed D10 peptide which represents residues 490- 


502 of S1 domain contained some interesting activities. 


The D10 peptide, which shared sequence homology 
with bradykinin, was able to generate antibodies cross- 
reactive with bradykinin. In addition, D10 peptide 
could stimulate A549 to produce IL-8 and 
proliferation as Ang I did. These results suggest that 
the region of D10 in S protein may bind to Ang I 
receptor, ACE2, and may be involved in the binding of 
SARS-CoV to ACE2. This is consistent with the 
previous report, which indicates that residues 318-510 
of S1 domain can bind to ACE2 [18] and is similar to 
the receptor binding domain of the HCoV-229E, 
which is within a fragment containing residues 407 to 
547 [19]. Therefore, region 490-502 of SI domain 
may be involved in the receptor binding domain of 
SARS-CoV. 

In summary, our results suggest that molecular 
mimicry occurs between SARS-CoV and host proteins. 
Motifs shared sequence homology with host proteins 
of SARS-COV may be involved in the binding and 
fusion of SARS-CoV to host cells. Antibody against 
these motifs may contain neutralization activity against 
SARS-CoV infection or _ participate in the 
immunopathogenesis induced by SARS-CoV. Small 
molecules derived from molecular structure of these 
motifs may provide alternative approaches to disrupt 
the infection of SARS-CoV [20]. 
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